Method of operating a network using differentiated pricing and a network configured to operate using differentiated pricing

ABSTRACT

The method of operating a network includes receiving a measurement of total traffic intensity, receiving an expected quality of service associated with a packet, and assigning the packet to one of two or more priority queues based on the measurement of total traffic intensity and the expected quality of service, the two or more priority queues having different delay metrics.

BACKGROUND

The amount of data traffic generated by mobile terminals connected to wireless networks continues to increase as technology progresses. However, the revenue generated from the mobile terminals has not increased at the same pace as costs of serving the mobile terminals. Previous data plans offered unlimited data traffic for a flat rate, while current data plans offer tiers of data usage at varying prices that allow a fixed amount of data per month. Despite this change, the price that users of the mobile terminals are willing to pay per Megabyte of data traffic is decreasing and the revenue generated from the current fixed data plans may not offset the costs of growing and serving the wireless networks. More efficient methods of operating the wireless networks in order to increase revenue and/or decrease the cost of serving the mobile terminals are desired.

SUMMARY

At least one example embodiment is directed to a method of operating a network using differentiated pricing and/or a network configured to operate using differentiated pricing.

According to at least one example embodiment, a method of operating a network includes receiving a measurement of total traffic intensity; receiving an expected quality of service associated with a packet; and assigning the packet to one of two or more priority queues based on the measurement of total traffic intensity and the expected quality of service, the two or more priority queues having different delay metrics.

The method may include servicing the packet from the one of the two or more priority queues.

The method may include measuring the total traffic intensity, and the receiving receives the measured total traffic intensity.

The method may include determining a traffic intensity associated with each of the two or more priority queues and the delay metric associated with each of the two or more priority queues based on the measurement of total traffic intensity, and the assigning assigns the packet to the one of two or more priority queues based on the expected quality of service and the determined traffic intensity associated with each of the two or more priority queues.

The expected quality of service may include a requested priority from a user, the requested priority being two or more priority levels that the user selects.

The receiving the expected quality of service associated with the packet may include receiving an expected quality of service associated with a mobile terminal; receiving the packet from the mobile terminal; and associating the packet with the expected quality of service associated with the mobile terminal.

The method may include determining a price of service for the packet based on the determined the delay metric associated with the one of two or more priority queues.

The delay metric may be an average delay and the determining determines the average delay based on an average service time and an average waiting time.

The delay metric may be a P-th percentile delay and the determining determines the P-th percentile delay based on a probability approximation including a waiting time for the one of two or more priority queues, a number of components of a probability mass function (PMF) associated with a service time, and a resolution of the PMF associated with the service time.

The delay metric may be a truncated average delay approximation and the determining determines the truncated average delay approximation based on a waiting time for the one of two or more priority queues and a delay for the non-prioritized system.

According to at least one example embodiment, a network includes a queue controller configured to receive a measurement of total traffic intensity, to receive an expected quality of service associated with a packet, and to assign the packet to one of two or more priority queues based on the measurement of total traffic intensity and the expected quality of service, the two or more priority queues having different delay metrics.

The network may also include a buffer configured to service the packet from the one of two or more priority queues.

The network may also include a traffic intensity measurement device configured to measure the total traffic intensity and to transmit the total traffic intensity to the queue controller.

The queue controller may be further configured to determine a traffic intensity associated with each of the two or more priority queue and the delay metric associated with each of the two or more priority queues based on the measurement of total traffic intensity, and the queue controller is configured to assign the packet to the one of two or more priority queues based on the expected quality of service and the determined traffic intensity associated with each of the two or more priority queues.

The expected quality of service may include a requested priority from a user, the requested priority being two or more priority levels that the user selects.

The queue controller may be further configured to receive an expected quality of service associated with a mobile terminal, to receive the packet from the mobile terminal and to associate the packet with the expected quality of service associated with the mobile terminal.

The queue controller may be configured to determine a price of service for the packet based on the determined delay metric associated with the one of two or more priority queues.

The delay metric may be an average delay and the queue controller is configured to determine the average delay based on an average service time and an average waiting time.

The delay metric may be a P-th percentile delay and the queue controller is configured to determine the P-th percentile delay based on a probability approximation including a waiting time for the one of two or more priority queues, a number of components of a probability mass function (PMF) associated with a service time, and a resolution of the PMF associated with the service time.

The delay metric may be a truncated average delay approximation and the queue controller is configured to determine the truncated average delay approximation based on a waiting time for the one of two or more priority queues and a delay for the non-prioritized system.

BRIEF DESCRIPTION OF THE DRAWINGS

Example embodiments will become more fully understood from the detailed description given herein below and the accompanying drawings, wherein like elements are represented by like reference numerals, which are given by way of illustration only and thus are not limiting of the present invention, and wherein:

FIG. 1 illustrates a base station that includes a computer system, the base station connecting mobile terminals in a coverage area of the base station to a wireless network, according to some example embodiments;

FIG. 2 illustrates an example structure of the computer system, according to some example embodiments;

FIG. 3 is a flowchart illustrating a method of determining the traffic intensity associated with each priority queue and a delay metric associated with each priority queue, according to some example embodiments;

FIG. 4 is a flowchart illustrating a method of assigning each mobile terminal to a priority queue, according to some example embodiments; and

FIG. 5 is a flowchart illustrating a method of placing a data request from a mobile terminal in the assigned priority queue associated with the mobile terminal, according to some example embodiments.

It should be noted that these figures are intended to illustrate the general characteristics of methods, structure and/or materials utilized in certain example embodiments and to supplement the written description provided below. These drawings are not, however, to scale and may not precisely reflect the precise structural or performance characteristics of any given example embodiment, and should not be interpreted as defining or limiting the range of values or properties encompassed by example embodiments. For example, the relative thicknesses and positioning of molecules, layers, regions and/or structural elements may be reduced or exaggerated for clarity. The use of similar or identical reference numbers in the various drawings is intended to indicate the presence of a similar or identical element or feature.

DETAILED DESCRIPTION

Example embodiments will now be described more fully with reference to the accompanying drawings, in which some example embodiments are shown. Example embodiments may, however, be embodied in many different forms and should not be construed as being limited to the example embodiments set forth herein; rather, these example embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those of ordinary skill in the art. In the drawings, the thicknesses of layers and regions are exaggerated for clarity. Like reference numerals in the drawings denote like elements, and thus their description will be omitted.

It will be understood that, although the terms “first”, “second”, etc. may be used herein to describe various elements, components, and/or sections, these elements, components, and/or sections should not be limited by these terms. These terms are only used to distinguish one element, component or section from another element, component, or section. Thus, a first element, component, or section discussed below could be termed a second element, component, or section without departing from the teachings of example embodiments.

The terminology used herein is for the purpose of describing particular example embodiments only and is not intended to be limiting of example embodiments. As used herein, the singular forms “a,” “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises”, “comprising”, “includes” and/or “including,” if used herein, specify the presence of stated features, integers, steps, operations, elements and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components and/or groups thereof.

It should also be noted that in some alternative implementations, the functions/acts noted may occur out of the order noted in the figures. For example, two figures shown in succession may in fact be executed substantially concurrently or may sometimes be executed in the reverse order, depending upon the functionality/acts involved.

Portions of the present invention and corresponding detailed description are presented in terms of software, or algorithms and symbolic representations of operations on data bits within a computer memory. These descriptions and representations are the ones by which those of ordinary skill in the art effectively convey the substance of their work to others of ordinary skill in the art. An algorithm, as the term is used here, and as it is used generally, is conceived to be a self-consistent sequence of steps leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of optical, electrical, or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.

It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise, or as is apparent from the discussion, terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical, electronic quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.

Some example embodiments will now be described with reference to the attached figures. Various structures, systems and devices are schematically depicted in the drawings for purposes of explanation only and so as to not obscure the present invention with details that are well known to those skilled in the art. Nevertheless, the attached drawings are included to describe and explain some illustrative example embodiments. The words and phrases used herein should be understood and interpreted to have a meaning consistent with the understanding of those words and phrases by those skilled in the relevant art. No special definition of a term or phrase, i.e., a definition that is different from the ordinary and customary meaning as understood by those skilled in the art, is intended to be implied by consistent usage of the term or phrase herein. To the extent that a term or phrase is intended to have a special meaning, i.e., a meaning other than that understood by skilled artisans, such a special definition will be expressly set forth in the specification in a definitional manner that directly and unequivocally provides the special definition for the term or phrase.

As used herein, the term “mobile terminal” may be considered synonymous to, and may hereafter be occasionally referred to, as a phone, wirelessly equipped laptop, a mobile, mobile unit, mobile user, subscriber, user, remote station, access terminal, receiver, etc., and may describe a remote user of wireless resources in a wireless network. The term “base station” (BS) may be considered synonymous to and/or referred to as a base transceiver station (BTS), Node B, etc. and may describe equipment that provides data and/or voice connectivity between a wireless network and one or more users.

FIG. 1 illustrates a base station 110 that includes a computer system 120, the base station 110 connecting mobile terminals 130 in a coverage area of the base station 110 to a wireless network 100, according to some example embodiments.

When a mobile terminal 130 is connected to the base station 110, the mobile terminal 130 may send a data request to wirelessly exchange packets of data with the base station 110. For example, the mobile terminal 130 may send a data request to wirelessly send packets of data to the base station 110 to upload data or send a data request to wirelessly receive packets of data from the base station 110 to download data. The base station 110 may relay the packets of data to the wireless network 100 or a larger network, such as the internet. The base station 110 may receive packets of data from the wireless network 100 or the larger network and relay the received packets of data to the mobile terminal 130. The computer system 120 may control the relay of data packets, as discussed below with regard to FIGS. 2-3.

The example embodiments illustrated in FIG. 1 show a single base station 110 that includes a computer system 120. However, example embodiments are not limited thereto, and a base station 110 may be collocated with an external computer system 120 or multiple base stations 110 may be in communication with a single computer system 120.

FIG. 2 illustrates an example structure of the computer system 120, according to some example embodiments. The computer system 120 may include a traffic intensity measurement device 205, a queue controller 210 and a buffer 220. The queue controller 210 may include a switch 212 and a controller 214.

The total traffic requested by mobile terminals 130 is input to the computer system 120. The total traffic requested by mobile terminals 130 may include all data requests by mobile terminals 130 within a coverage area of the base station 110 to upload or download packets of data. The traffic intensity measurement device 205 may measure the total traffic requested by mobile terminals 130 and provide the controller 214 with a measurement of the total traffic intensity (λ_(b)). The measurement of total traffic intensity (λ_(b)) may be in packets/second, Megabytes/second or other measurements for a rate of data.

The controller 214 may receive an expected quality of service for all mobile terminals 130 active in the coverage area of the base station 110. A mobile terminal 130 is considered active if the mobile terminal 130 is connected to the base station 110.

Based on the expected quality of service for a mobile terminal 130, the controller 214 may associate one of several priority queues with the mobile terminal 130. The switch 212 may assign data requests from the mobile terminal 130 to the priority queue associated with the mobile terminal 130.

The buffer 220 may buffer the assigned data requests until the assigned data requests are serviced and the data requests or packets may be serviced based on the priority queue. For example, the buffer 220 may service all of the data requests or packets in the first priority queue before servicing any of the data requests or packets in the second priority queue.

The computer system 120 may output prioritized traffic, which indicates the order in which each data request from the mobile terminals 130 will be serviced or processed by the base station 110.

FIG. 3 is a flowchart illustrating a method of determining the traffic intensity associated with each priority queue and a delay metric associated with each priority queue, according to some example embodiments.

In S310, the queue controller 210 may receive a measurement of the total traffic intensity (λ_(b)). For example, the queue controller 210 may receive a measurement of the total traffic intensity (λ_(b)) from the traffic intensity measurement device 205. Alternatively, the queue controller 210 may receive a measurement of the total traffic intensity (λ_(b)) from a source external to the computer system 120, such as from the base station 110 or from another device in the wireless network 100. As discussed above, the total traffic intensity (λ_(b)) may be a measurement [for example, packets/second] of the total traffic requested by mobile terminals 130 including all data download requests or data upload requests.

In S320, the queue controller 210 may determine the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) and a delay metric associated with each priority queue. The delay metric associated with each priority queue may be based on the traffic intensities associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) and may indicate a delay time for an individual data request to be serviced. For example, the total traffic intensity (λ_(b)) may be fragmented and its fractions may be assigned to four different priority queues (λ₁, λ₂, λ₃, λ₄). The sum of traffic intensities associated with each priority queue (λ₁, λ₂, λ₃, λ₄) is equal to the total traffic intensity (λ_(b)). Based on the traffic intensity associated with each priority queue (λ₁, λ₂, λ₃, λ₄), each priority queue may have a different delay metric.

The queue controller 210 may determine the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) using the equations discussed below with reference to Pricing Policies 1-3. Pricing Policies 1-3 may relate a delay metric associated with each priority queue with a price of service, as will be discussed in greater detail below. The traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) may be used as the optimal distribution of mobile terminals 130 amongst the priority queues. For example, the queue controller 210 may determine that the optimal distribution of mobile terminals 130 among the priority queues is 40% in a first priority queue, 27% in a second priority queue, 19% in a third priority queue and 14% in a fourth priority queue based on the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) calculated using Pricing Policy 2. Use of the determined optimized distribution will be discussed in greater detail below with reference to S420 of FIG. 4.

The queue controller 210 may determine the delay metric associated with each priority queue using the equations discussed below with reference to Pricing Policies 1-3. The delay metric may depend on which Pricing Policy is chosen. For example, the delay metric may depend on a waiting time and a service time. The waiting time is the time spent by a data request in the buffer 220 prior to being serviced, while the service time is the time required for servicing the data request. The waiting time may depend on a quantity of data requests already in the priority queue and a quantity of data requests in any priority queue with a higher priority.

The queue controller 210 may determine the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) and the delay metric associated with each priority queue periodically. For example, the queue controller 210 may determine these values every 10 milliseconds, or the queue controller 210 may determine these values every 10 minutes. However, example embodiments are not limited thereto, and the queue controller 210 may use any desired period of time. Alternatively, the queue controller 210 may determine these values dynamically. However, determining the values dynamically may consume more resources.

As an alternative to using the equations of Pricing Policies 1-3, the queue controller 210 may determine the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) and the delay metric associated with each priority queue based on a lookup table. For example, the equations discussed below with reference to Pricing Policies 1-3 may be solved and quantized in the lookup table, and the queue controller 210 may find the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) in the lookup table based on the total traffic intensity (λ_(b)). As an example, the lookup table may have a range of traffic intensities (λ_(b)) from 1 MB/s to 100 MB/s, with a quantization of 1 MB/s. Thus, if the total traffic intensity (λ_(b)) is between 1-2 MB/s, the lookup table may provide a first traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)), whereas if the total traffic intensity (λ_(b)) is between 2-3 MB/s, the lookup table may provide a second traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)).

FIG. 4 is a flowchart illustrating a method of assigning each mobile terminal to a priority queue, according to some example embodiments.

The queue controller 210 may receive an expected quality of service in S410. For example, the queue controller 210 may receive an expected quality of service from any active mobile terminals 130 connected to the base station 110. A list of all active mobile terminals 130 connected to the base station 110 may be included in a table stored in the base station 110. Alternatively, the list of all active mobile terminals may be included in a table stored elsewhere in the wireless network 100.

The expected quality of service may be included with a data request or data packet from the mobile terminal 130 or the expected quality of service may be received separately and associated with the mobile terminal 130. The expected quality of service may be based on the data plan associated with the mobile terminal 130 or, alternatively, the expected quality of service may be based on a requested priority from a user of the mobile terminal 130, the requested priority being two or more priority levels that the user selects.

The expected quality of service may be, for example, a unitless value that the mobile terminal 130 requests indicating a level of importance of shorter delays or lower prices. For example, the expected quality of service associated with the mobile terminal 130 may be a priority level from 1-100 based on the price a user of the mobile terminal 130 wants to pay or the delay the user wants to experience, with 1 indicating the mobile terminal 130 is requesting the longest delay (and therefore lowest price) and 100 indicating the mobile terminal 130 is requesting the shortest delay (and therefore highest price). Example embodiments are not limited thereto, and the expected quality of service may be based on other methods. The process of setting prices for service is discussed in greater detail below with regard to Pricing Policies 1-3.

If the expected quality of service is based on the data plan associated with the mobile terminal 130, all data packets or data requests from the mobile terminal 130 may have the same expected quality of service. Therefore, the queue controller 210 may associate this expected quality of service with every data request from the mobile terminal 130.

Alternatively, if the expected quality of service is based on a requested priority from a user of the mobile terminal 130, the user may dynamically change the expected quality of service based on a desired speed or importance. For example, the user may request a higher priority while performing work-related functions and request a lower priority while performing non-work-related functions. The higher priority will offer faster service, but the price of service will increase accordingly, as discussed below with regard to Pricing Policies 1-3.

The queue controller 210 may store the expected quality of service associated with the mobile terminal 130 or communicate with the base station 110 or another device to determine the expected quality of service associated with the mobile terminal 130.

In S420, the queue controller 210 may assign a first mobile terminal 130 to a priority queue based on the measurement of total traffic intensity (λ_(b)) and the expected quality of service for the first mobile terminal 130. For example, the queue controller 210 may use the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) determined in S320 as the optimal distribution of mobile terminals 130 amongst the priority queues. Based on the optimal distribution of mobile terminals 130 amongst the priority queues, the queue controller 210 may compare the expected quality of service for the first mobile terminal 130 with the expected quality of service of all other mobile terminals 130 to assign the first mobile terminal 130 to a particular priority queue.

As an example, the queue controller may determine that the optimal distribution of mobile terminals 130 has 40% of the mobile terminals 130 assigned to a first priority queue. If the expected quality of service for the first mobile terminal 130 is in the top 30% of expected quality of services for all active mobile terminals 130, the queue controller 210 may assign the first mobile terminal 130 to the first priority queue. However, if the expected quality of service for the first mobile terminal 130 is between 50% and 60% of expected quality of services for all active mobile terminals 130, the queue controller 210 may assign the first mobile terminal 130 to the second priority queue.

As discussed above with reference to S320 and below with reference to Pricing Policies 1-3, the optimal distribution of mobile terminals 130 may vary depending on the Pricing Policy or delay metric used. Moreover, the queue controller 210 may rely on a lookup table or may calculate the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) as necessary. If using the lookup table, the queue controller 210 may use the measured total traffic intensity (λ_(b)) to determine the optimal distribution and/or an estimated delay metric associated with each priority queue.

FIG. 5 is a flowchart illustrating a method of placing a data request from a mobile terminal 130 in the assigned priority queue associated with the mobile terminal 130, according to some example embodiments.

In S510, the queue controller 210 may receive a data request from a mobile terminal 130, such as a data request to upload or download packets of data. In S520, the queue controller may place the data request in the priority queue associated with the mobile terminal 130 that was assigned in S420. In S530, the buffer 220 may service the data request based on the assigned priority queue.

As an example, the queue controller 210 may receive a first data request from a first mobile terminal 130. Previously, the first mobile terminal 130 was assigned to the first priority queue. Therefore, the queue controller 210 may place the first data request from a first mobile terminal 130 in a first priority queue in the buffer 220. As the first data request is in the first priority queue, as soon as all data request already in the first priority queue are serviced, the buffer 220 may service the first data request.

As discussed above referring to S410, a user of the mobile terminal 130 may dynamically change the expected quality of service based on a desired speed or importance. Therefore, the queue controller 210 may receive a second expected quality of service from the mobile terminal 130 and assign the mobile terminal 130 to a second priority queue simultaneously to performing steps S510-S530. Thus, a second data request from the first mobile terminal 130 may be placed in a different priority queue from the first data request from the first mobile terminal 130. Example embodiments are not limited thereto, and the queue controller 210 may move the first data request to a different priority queue if the mobile terminal 130 is assigned to a different priority queue prior to the first request being serviced by the buffer 220.

The buffer 220 may service the data requests or packets based on the priority queue. For example, the buffer 220 may service all of the data requests or packets in the first priority queue before servicing any of the data requests or packets in the second priority queue. Similarly, the buffer 220 may service all of the data requests or packets in the second priority queue before servicing any of the data requests or packets in the third priority queue. In this way, the higher the priority queue, the faster the buffer 220 services the data requests or packets. This is an example of a delay system.

Due to the delay system, a delay metric associated with each priority queue may depend on the total traffic intensity (λ_(b)) and may indicate a delay time for an individual data request to be serviced. For example, because the buffer 220 may service all of the data requests or packets in the first priority queue before servicing any of the data requests or packets in the second priority queue, the second priority queue has a delay due to the number of data requests or packets currently in the first priority queue and any additional data requests or packets received by the first priority queue. If the total traffic intensity (λ_(b)) is low, which may reflect a low load on the base station 110, the last priority queue may have a delay metric similar to a delay metric for the first priority queue. However, if the total traffic intensity (λ_(b)) is high, which may reflect a high load on the base station 110, the last priority queue may have a delay metric far longer than a delay metric for the first priority queue.

Non-preemptive priority queues or preemptive priority queues may be used. In a priority queue, a data request with a lower priority is only serviced when there are no data requests with higher priority levels in the system. In a non-preemptive system, once the processing of a data request or packet has started, it is allowed to carry on to completion even if data requests or packets of higher priority levels arrive at the system in the intervening time. In contrast, a preemptive system immediately suspends the processing of a lower-priority data request or packet when a high-priority data request or packet arrives.

As discussed above, a delay metric associated with each priority queue may be used to calculate a price of service for each data request, as will be discussed in greater detail below with reference to Pricing Policies 1-3.

Price of Service

The computer system 120 may translate typical network quantities, such as delay in seconds, into monetary quantities, such as $/packet or $/Byte. To calculate the price of service, the computer system 120 may consider revenue to be proportional to a total traffic intensity (λ_(b)) over the delay metric, the total traffic intensity (λ_(b)) being the rate at which data requests arrive at the computer system 120.

$\begin{matrix} {{Revenue} \propto \frac{{Total}\mspace{14mu}{Traffic}\mspace{14mu}{{Intensity}\left( \lambda_{b} \right)}}{{Delay}\mspace{14mu}{Metric}}} & (1) \end{matrix}$

Similarly, the queue controller 210 may determine a price of service of a single data request based on the delay metric of the assigned priority queue for the data request. For example, the computer system 120 may use an inverse pricing function, such as P_(n)=c/T_(n), where P_(n) is the price for a packet [$/packet], T_(n) is the delay [seconds] and c is a scalar constant for the network [$·seconds/packet].

Below, three examples of potential pricing policies used to define the delay metric and corresponding price of service for each data request are discussed. Based on the pricing policy, the distribution of the mobile terminals 130 in each priority queue may be optimized to increase revenue or reduce future network expansion costs by reducing bandwidth from the baseline system. For example, the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)) calculated using Pricing Policies 1-3 may be used as the optimal distribution of mobile terminals 130 amongst the priority queues. The optimized distributions are determined in S320 and used in S420, discussed in greater detail above.

Pricing Policy 1—Average Delay:

For example, the average delays T_(n) for the priority queues n=1, 2, . . . , N may be used as the delay metrics. As is discussed in greater detail below with reference to equations (2)-(8), the queue controller 210 may determine the average delay based on an average service time and an average waiting time.

Pricing Policy 2—P-th percentile Delay:

For example, the P-th percentile delays T_(n) ^(P) for the priority queues n=1, 2, . . . , N may be used as the delay metrics. As is discussed in greater detail below with reference to equations (10)-(14), the queue controller 210 may determine the P-th percentile delay based on a probability approximation, which may include a waiting time for the priority queues, a number of components of a probability mass function (PMF) associated with a service time, and a resolution of the PMF associated with the service time.

Pricing Policy 3—Percentile Truncated Average Delay:

For example, the truncated average delays T _(n) ^(P) for the priority queues n=1, 2, . . . , N up to the P-th percentile delay of the baseline non-prioritized system may be used as the delay metrics. As is discussed in greater detail below with reference to equations (15)-(21), the queue controller 210 may determine the percentile truncated average delay approximation based on a waiting time for the priority queues and a delay for the non-prioritized system.

Pricing Policy 1: Average Delay

From a network operator perspective, the variables that can be controlled to achieve the desired results with respect to revenues are the fractional traffic intensities (λ₁, λ₂ . . . , λ_(N)) impinging on the distinct priority queues. Therefore, Policy 1 uses the average delays T₁, T₂, . . . , T_(N) as variables of the utility function:

maximize ⁢ ⁢ U ⁡ ( λ 1 , … ⁢ , λ N , T 1 , … ⁢ , T N ) = c ⁢ ∑ i = 1 N ⁢ λ i T i ︷ Revenue ⁢ ⁢ over ( 2 ) λ n ∈ + , n = 1 , … ⁢ , N ⁢ ⁢ subject ⁢ ⁢ to ⁢ : ( 3 ) ∑ i = 1 N ⁢ λ i = λ b ( 4 ) B = α · B b , 0 < α ≤ 1 ( 5 ) T n = X n _ ︸ Average ⁢ ⁢ Service ⁢ ⁢ Time + ∑ i = 1 N ⁢ ( λ i · X i 2 _ ) 2 · [ 1 - ∑ i = 1 n - 1 ⁢ ( λ i · X i _ ) ] · [ 1 - ∑ i = 1 n ⁢ ( λ i · X i _ ) ] ︸ Average ⁢ ⁢ Waiting ⁢ ⁢ Time , ⁢ n = 1 , … ⁢ , N , ( 6 ) X n _ = ⁢ { F B · S } = F B · ⁢ { 1 S } , n = 1 , … ⁢ , N ( 7 ) X n 2 _ = ⁢ { ( F B · S ) 2 } = F 2 B 2 · ⁢ { ( 1 S ) 2 } , n = 1 , … ⁢ , N . ( 8 )

In the policy above,

+ indicates all positive real numbers, E indicates the Expectation value, λ_(b) is the total traffic intensity in the non-prioritized baseline system in packets per second, B_(b) is the bandwidth of the non-prioritized baseline system, α is a bandwidth scaling parameter that connects the bandwidth of the prioritized system B to the bandwidth of non-prioritized baseline system B_(b), c is a constant that turns the proportionality relation in (1) to an equality and is connected to the network costs, F=320 KB is the size of the file being downloaded (which may be considered constant) and S is a random variable representing the spectral efficiency of the wireless network which may be determined by simulation. For example, simulating a two-ring, tree-sectored wireless network with wraparound, path-loss and shadow fading, uniform mobile terminal 130 distribution, and full-power transmission at each sector, we obtain:

$\begin{matrix} {{\left\{ \frac{1}{S} \right\}} \approx {0.81\mspace{14mu}{and}\mspace{14mu}\left\{ \left( \frac{1}{S} \right)^{2} \right\}} \approx {1.04.}} & (9) \end{matrix}$

However, these equations are provided to illustrate a particular example embodiment. Example embodiments may vary and are not limited to the equations shown above.

A baseline load ρ_(b) can be solved for using equation (7) above:

$\rho_{b} = {\lambda_{b}{\left\{ \frac{F}{B_{b} \cdot S} \right\}.}}$ For a certain baseline load ρ_(b), any total traffic intensity (λ_(b)) can be made possible by properly scaling the baseline bandwidth B_(b).

As shown above in equation (6), the average delay time is based on an average service time X_(n) and an average waiting time. The average service time generally depends on the mobile terminal's 130 file size and realized Signal to Interference plus Noise Ratio (SINR). The SINR distribution may be induced by a wireless network geometry incorporating path loss and shadow fading, and is represented by the random variable S in the equations above.

If the pricing policy is based on average delay, such as Pricing Policy 1, uniform allocation may perform nearly as well as an optimal distribution calculated using the equations above. Therefore, the queue controller 210 may use a uniform allocation to distribute mobile terminals 130 amongst the priority queues, instead of calculating the traffic intensity associated with each priority queue (λ₁, λ₂ . . . , λ_(N)). This may simplify S320 above, for example, as the queue controller 210 may evenly distribute the total traffic intensity (λ_(b)) into each priority queue.

Policy 2: P-th percentile Delay

In Policy 2, the fractional traffic intensities (λ₁, λ₂ . . . , λ_(N)) and the P-th percentile delays T₁ ^(P), T₂ ^(P), . . . , T_(N) ^(P) are variables of the utility function:

⁢maximize ⁢ ⁢ U ⁡ ( λ 1 , … ⁢ , λ N , T 1 P , … ⁢ , T N P ) = c ⁢ ∑ i = 1 N ⁢ λ i T i P ︷ Revenue ⁢ ⁢ ⁢ over ( 10 ) ⁢ λ n ∈ + , n = 1 , … ⁢ , N ⁢ ⁢ ⁢ subject ⁢ ⁢ to ⁢ : ( 11 ) ⁢ ∑ i = 1 N ⁢ λ i = λ b ( 12 ) ⁢ B = α · B b , 0 < α ≤ 1 ( 13 ) ∑ i = 0 m ⁢ ⁢ i ⁢ ⁢ n ⁢ { ⌊ T n P Δ ⁢ ⁢ i ⌋ , N S } ⁢ ( p S ⁡ [ i ] · ( 1 - ρ n ⁢ⅇ - γ n · ( T n P - ⅈ · Δ ⁢ ⁢ t ) ) ) ︸ Probability ⁢ ⁢ Approximation ⁢ ⁢ for ⁢⁢the ⁢ ⁢ P ⁢ - ⁢ th ⁢ ⁢ Percentile ⁢ ⁢ Delay ≈ ⁢ { τ n ≤ T n P } = P , ⁢ ⁢ for ⁢ ⁢ n = 1 , 2 , … ⁢ , N ( 14 )

In Policy 2,

+ indicates all positive real numbers and c, λ_(b), B, α and B_(b) are as defined in Policy 1. Moreover, p_(S)[i], N_(S) and Δt are, respectively, the components, a number of components of a probability mass function (PMF) associated with a service time, and a resolution of the PMF associated with the service time. p_(S)[i], N_(S) and Δt may be measured beforehand and stored in the computer system according to known methods.

The PMF of the service time may be computed using the simulation of the wireless network as described in Policy 1. τ_(p) is the random variable representing the delay for queue n. As an example, if the P-th percentile delay is the 95-th percentile delay, P=0.95 in equation (14).

The variables γ_(n) and ρ_(n) are calculated using statistics of the waiting time, for example using equations (27) and (28), which are discussed in greater detail below. For example, the probability density function (PDF) and cumulative distribution function (CDF) of the waiting times may be approximated by degenerated hyper-exponential distribution functions, as discussed below with regard “Statistics of the Waiting Time.”

Policy 3: Percentile Truncated Average Delay

In Policy 3, the fractional traffic intensities (λ₁, λ₂ . . . , λ_(N)), the average delays T ₁ ^(T) ⁰ , T ₂ ^(T) ⁰ , . . . , T _(N) ^(T) ⁰ of the prioritized queuing system truncated to the P₀-th percentile delay T₀ of the non-prioritized baseline system, and fractions of traffic P₁, P₂, . . . , P_(N) in each priority queue with delays less than T₀ are variables of the utility function:

maximize ⁢ ⁢ U ⁡ ( λ 1 , … ⁢ , λ N , P 1 , … ⁢ , P N , T _ 1 T 0 , … ⁢ , T _ N T 0 ) = c ⁢ ∑ i = 1 N ⁢ P i ⁢ λ i T _ i T 0 ︷ Revenue ⁢ ⁢ over ( 15 ) λ n ∈ + , n = 1 , … ⁢ , N ⁢ ⁢ subject ⁢ ⁢ to ⁢ : ( 16 ) ∑ i = 1 N ⁢ λ i = λ b ( 17 ) B = α · B b , 0 < α ≤ 1 ( 18 ) ∑ i = 0 m ⁢ ⁢ i ⁢ ⁢ n ⁢ { ⌊ T 0 Δ ⁢ ⁢ t ⌋ , N S } ⁢ ( p S ⁡ [ i ] · ( 1 - ρ 0 ⁢ ⅇ - γ 0 · ( T 0 - ⅈ · Δ ⁢ ⁢ t ) ) ) ︸ Probability ⁢ ⁢ Approximation ⁢ ⁢ for ⁢ ⁢ the ⁢ ⁢ P 0 ⁢ - ⁢ th ⁢ ⁢ Percentile ⁢ ⁢ Delay ≈ ⁢ { τ 0 ≤ T 0 } = P 0 , ⁢ ( Non ⁢ - ⁢ prioritized , baseline ⁢ ⁢ system ) ( 19 ) P n = ⁢ { τ n ≤ T 0 } ≈ ∑ i = 0 m ⁢ ⁢ i ⁢ ⁢ n ⁢ { ⌊ T 0 Δ ⁢ ⁢ t ⌋ , N S } ⁢ ( p S ⁡ [ i ] · ( 1 - ρ n ⁢ ⅇ - γ n · ( T 0 - ⅈ · Δ ⁢ ⁢ t ) ) ) ︸ Probability ⁢ ⁢ Approximation ⁢ ⁢ for ⁢ ⁢ the ⁢ ⁢ P n ⁢ - ⁢ th ⁢ ⁢ Percentile ⁢ ⁢ Delay , ⁢ n = 1 , 2 , … ⁢ , N , ( 20 ) T _ n T 0 ⁢ ≈ ∑ i = 0 m ⁢ ⁢ i ⁢ ⁢ n ⁢ { ⌊ T _ 0 Δ ⁢ ⁢ t ⌋ , N S } ⁢ [ p S , n ⁡ [ i ] · ( ( 1 - ρ n ) · i · Δ ⁢ ⁢ t 1 - ρ n ⁢ ⅇ - γ n · ( T 0 - ⅈ · Δ ⁢ ⁢ t ) + … + ρ n · ( 1 + γ n · i · Δ ⁢ ⁢ t - ( 1 + γ n ⁢ T 0 ) · ⅇ - γ n · ( T 0 - ⅈ · Δ ⁢ ⁢ t ) ) γ n · ( 1 - ρ n ⁢ ⅇ - γ n · ( T 0 - ⅈ · Δ ⁢ ⁢ t ) ) ) ] ︸ Truncated ⁢ ⁢ Average ⁢ ⁢ Approximation , ⁢ n = 1 , … ⁢ , N . ( 21 )

In Policy 3, c, λ_(b), B, α, B_(b), p_(S)[i], N_(S), Δt, γ_(n), ρ_(n) and τ_(n) are as defined in Policy 1 and Policy 2. The variables γ₀ and ρ₀ are also calculated using statistics of the waiting time as discussed in Policy 2, and τ₀ is the random variable representing the delay for the baseline, non-prioritized system. As an example, if the P₀-th percentile delay is the 95-th percentile delay, P₀=0.95 in equation (19).

While example embodiments have been particularly shown and described, it will be understood by one of ordinary skill in the art that variations in form and detail may be made therein without departing from the spirit and scope of the claims.

Statistics of the Waiting Time

As discussed above, the variables γ_(n) and ρ_(n) are calculated using statistics of the waiting time. For example, the probability density function (PDF) and cumulative distribution function (CDF) of the waiting times may be approximated by degenerated hyper-exponential distribution functions. However, example embodiments may vary and are not limited to the example embodiments discussed above.

The Laplace-Stieltjes transform (LST) of the waiting time distribution function may be given by:

$\begin{matrix} {{W*(s)} = \frac{\left( {1 - {\lambda\; s}} \right)s}{s - \lambda + {\lambda\; B*(s)}}} & (22) \end{matrix}$

A closed form expression for the waiting time LST requires the service time LST, B*(s), to be known. Unfortunately, an analytically tractable expression for B*(s) is not known. Hence, we may use degenerated hyper-exponential distributions to approximate the PDF and CDF for the waiting time. However, as discussed above, these equations are provided to illustrate a particular example embodiment. Example embodiments may vary and are not limited to the equations shown.

In this sense, we have for the PDF:

$\begin{matrix} {{f_{W}(w)} = \left\{ {\begin{matrix} {{{\left( {1 - \rho} \right){\delta(w)}} + {\rho\;\gamma\;{\mathbb{e}}^{{- \gamma}\; w}}},} & {{{for}\mspace{14mu} w} \geq 0} \\ {0,} & {elsewhere} \end{matrix},{{{with}\mspace{14mu}\gamma} \geq {0\mspace{14mu}{and}\mspace{14mu} 0} \leq \rho \leq 1},} \right.} & (23) \end{matrix}$

where δ(w) is the Dirac delta function.

Moreover, the CDF is given by

$\begin{matrix} {{F_{W}(t)} = {{P\left\{ {W \leq t} \right\}} = {{\int_{0}^{t}{{f(w)}{\mathbb{d}w}}} = \left\{ {\begin{matrix} {{1 - {\rho\mathbb{e}}^{{- \gamma}\; t}},} & {{{for}\mspace{14mu} t} \geq 0} \\ {0,} & {elsewhere} \end{matrix},{{with}\mspace{14mu} t},{\gamma \geq {0\mspace{14mu}{and}\mspace{14mu} 0} \leq \rho \leq 1},} \right.}}} & (24) \end{matrix}$

In a prioritized system, the PDF and CDF of the waiting times of each class n=1, 2, . . . N can be modeled by a version of the functions above, i.e.:

$\begin{matrix} {{f_{W,n}(w)} = \left\{ {\begin{matrix} {{{\left( {1 - \rho_{n}} \right){\delta(w)}} + {\rho_{n}\gamma_{n}{\mathbb{e}}^{{- \gamma_{n}}w}}},} & {{{for}\mspace{14mu} w} \geq 0} \\ {0,} & {elsewhere} \end{matrix},{{{with}\mspace{14mu}\gamma_{n}} \geq {0\mspace{14mu}{and}\mspace{14mu} 0} \leq \rho_{n} \leq 1},} \right.} & (25) \\ {{F_{W,n}(t)} = {{P\left\{ {W_{n} \leq t} \right\}} = \left\{ {\begin{matrix} {{1 - {\rho_{n}{\mathbb{e}}^{{- \gamma_{n}}t}}},} & {{{for}\mspace{14mu} t} \geq 0} \\ {0,} & {elsewhere} \end{matrix},{{{with}\mspace{14mu}\gamma_{n}} \geq {0\mspace{14mu}{and}\mspace{14mu} 0} \leq \rho_{n} \leq 1},} \right.}} & (26) \end{matrix}$

where the parameters and can be computed as

$\begin{matrix} {{\rho_{n} = \frac{2{\overset{\_}{W_{n}}}^{2}}{\overset{\_}{W_{n}^{2}}}},{and}} & (27) \\ {{\gamma_{n} = \frac{2\overset{\_}{W_{n}}}{\overset{\_}{W_{n}^{2}}}},} & (28) \end{matrix}$

where W_(n) is the average waiting time for class n, and the second moment of the waiting time W_(n) ² for class is given as:

$\begin{matrix} {{\overset{\_}{W_{n}^{2}} = {\frac{\sum\limits_{i = 1}^{N}{\lambda_{i}\overset{\_}{X_{i}^{3}}}}{3\left( {1 - {\sum\limits_{i = 1}^{n - 1}{\lambda_{i}\overset{\_}{X_{i}}}}} \right)^{2}\left( {1 - {\sum\limits_{i = 1}^{n}{\lambda_{i}\overset{\_}{X_{i}}}}} \right)} + \frac{\left( {\sum\limits_{i = 1}^{n}{\lambda_{i}\overset{\_}{X_{i}^{2}}}} \right)\left( {\sum\limits_{i = 1}^{N}{\lambda_{i}\overset{\_}{X_{i}^{2}}}} \right)}{2\left( {1 - {\sum\limits_{i = 1}^{n - 1}{\lambda_{i}\overset{\_}{X_{i}}}}} \right)^{2}\left( {1 - {\sum\limits_{i = 1}^{n}{\lambda_{i}\overset{\_}{X_{i}}}}} \right)^{2}} + \frac{\left( {\sum\limits_{i = 1}^{n - 1}{\lambda_{i}\overset{\_}{X_{i}^{2}}}} \right)\left( {\sum\limits_{i = 1}^{N}{\lambda_{i}\overset{\_}{X_{i}^{2}}}} \right)}{2\left( {1 - {\sum\limits_{i = 1}^{n - 1}{\lambda_{i}\overset{\_}{X_{i}}}}} \right)^{3}\left( {1 - {\sum\limits_{i = 1}^{n}{\lambda_{i}\overset{\_}{X_{i}}}}} \right)}}},} & (29) \end{matrix}$

and X_(n) , X_(n) ² , X_(n) ³ , n=1, . . . , N, are, respectively, the averages, the second moments and the third moments of the service times. Moreover, X_(n) , X_(n) ² , n=1, . . . , N, are given by equations (7) and (8), respectively, and X_(n) ³ is given as:

$\begin{matrix} {{\overset{\_}{X_{n}^{3}} = {{\left\{ \left( \frac{F}{B \cdot S} \right)^{3} \right\}} = {{\frac{F^{3}}{B^{3}} \cdot}\left\{ \frac{1^{3}}{S} \right\}}}},{n = 1},\ldots\mspace{14mu},N,{with}} & (30) \\ {{\left\{ \frac{1^{3}}{S} \right\}} \approx 7.75} & (31) \end{matrix}$

However, these equations are provided to illustrate a particular example embodiment. Example embodiments may vary and are not limited to the equations shown above.

Observe that the conditions γ_(n)≧0 and 0≦ρ_(n)≦1 implies that 2 {overscore (W _(n))}²≦ W _(n) ² , W _(n) ≧0 and W _(n) ²≧0,ƒ or n=1,2, . . . ,N  (32)

must be fulfilled in order for the approximation by the degenerated hyper-exponential distribution to be valid. Moreover, choosing γ_(n) and ρ_(n) as (27) and (28), respectively, the average and second moment of ƒ_(W,n)(w) are equal to W_(n) and W_(n) ² , n=1, . . . , N, respectively.

Because the delay is defined as T_(n)=(1/μ_(n))+W_(n) (i.e., the sum of waiting time and service time), its PDF is given by the convolution of the waiting time PDF with the service time distribution function: ƒ_(T,n)(t)=ƒ_(W,n(t))*ƒ_(S,n)(t).  (33)

For numeric computation purposes, we consider the PDF of the service time to be sampled to a discrete probability mass function (PMF) of sufficient resolution, i.e.: P _(S,n)(k·Δt)=Σ_(i=0) ^(N) ^(S) p _(S,n) [i]·δ(k−1),k=0,1, . . . ,N _(S),  (34)

where Δt is the time resolution of the PMF, N_(S) is the number of components in the PMF, δ(·) is the Dirac delta function and n=1, 2, . . . , N.

Hence, the convolution in equation (33) can be written as: ƒ_(T,n)(t)=Σ_(i=0) ^(N) ^(S) p _(S,n) [i]·ƒ _(W,n)(t−i·Δt).  (35)

Therefore, the CDF of the delays are given by:

$\begin{matrix} \begin{matrix} {{F_{T,n}(t)} = {P\left\{ {T_{n} \leq t} \right\}}} \\ {= {\int_{0}^{t}{{f_{T,n}(\tau)}{\mathbb{d}\tau}}}} \\ {= {\int_{0}^{t}{\sum\limits_{i = 0}^{m\; i\; n{\{{{\lbrack\frac{t}{\Delta\; t}\rbrack},N_{S}}\}}}{{{p_{S,n}\lbrack i\rbrack} \cdot {f_{W,n}\left( {\tau - {{i \cdot \Delta}\; t}} \right)}}{\mathbb{d}\tau}}}}} \\ {= {\sum\limits_{i = 0}^{m\; i\; n{\{{{\lbrack\frac{t}{\Delta\; t}\rbrack},N_{S}}\}}}\left( {{p_{S,n}\lbrack i\rbrack} \cdot {\int_{{i \cdot \Delta}\; t}^{t}{{f_{W,n}\left( {\tau - {{i \cdot \Delta}\; t}} \right)}{\mathbb{d}\tau}}}} \right)}} \\ {= {\sum\limits_{i = 0}^{m\; i\; n{\{{{\lbrack\frac{t}{\Delta\; t}\rbrack},N_{S}}\}}}\left( {{{p_{S,n}\lbrack i\rbrack} \cdot {\int_{{i \cdot \Delta}\; t}^{t}{\left( {1 - \rho_{n}} \right)\delta\left( {\tau - {{i \cdot \Delta}\; t}} \right)}}} +} \right.}} \\ \left. {\rho_{n}\gamma_{n}{\mathbb{e}}^{{- \gamma_{n}} \cdot {({\tau - {{{\mathbb{i}} \cdot \Delta}\; t}})}}{\mathbb{d}\tau}} \right) \\ {= {\sum\limits_{i = 0}^{m\; i\; n{\{{{\lbrack\frac{t}{\Delta\; t}\rbrack},N_{S}}\}}}\left( {{p_{S,n}\lbrack i\rbrack} \cdot \left( {\rho_{n}{\mathbb{e}}^{{- \gamma_{n}} \cdot {({t - {{{\mathbb{i}} \cdot \Delta}\; t}})}}} \right)} \right)}} \end{matrix} & (36) \end{matrix}$ 

What is claimed is:
 1. A method of operating a network, the method comprising: receiving a measurement of total traffic intensity; receiving an expected quality of service associated with a packet; determining a traffic intensity associated with respective ones of two or more priority queues and a delay metric associated with the respective ones of the two or more priority queues based on the measurement of total traffic intensity, the two or more priority queues having different delay metrics associated therewith; assigning the packet to one of the two or more priority queues based on the expected quality of service and the determined traffic intensity associated with the respective ones of the two or more priority queues; and determining a price of service for the packet based on the determined delay metric associated with the one of the two or more priority queues assigned to the packet.
 2. The method of claim 1, further comprising: switching the packet to the one of the two or more priority queues by a switch.
 3. The method of claim 1, further comprising: measuring the total traffic intensity, wherein the receiving receives the measured total traffic intensity.
 4. The method of claim 1, wherein the expected quality of service includes a requested priority from a user, the requested priority being one of two or more priority levels from which the user selects.
 5. The method of claim 4, wherein the receiving the expected quality of service associated with the packet further comprises: receiving an expected quality of service associated with a mobile terminal; receiving the packet from the mobile terminal; and associating the packet with the expected quality of service associated with the mobile terminal.
 6. The method of claim 1, wherein the delay metric is an average delay and the determining the price determines the average delay based on an average service time and an average waiting time.
 7. The method of claim 1, wherein the delay metric is a P-th percentile delay and the determining the price determines the P-th percentile delay based on a probability approximation including a waiting time for the one of two or more priority queues, a number of components of a probability mass function (PMF) associated with a service time, and a resolution of the PMF associated with the service time.
 8. The method of claim 1, wherein the delay metric is a truncated average delay approximation and the determining the price determines the truncated average delay approximation based on a waiting time for the one of two or more priority queues and a delay for the non-prioritized system.
 9. A network, the network comprising: a queue controller configured to, receive a measurement of total traffic intensity, receive an expected quality of service associated with a packet, determine a traffic intensity associated with respective ones of two or more priority queues and a delay metric associated with the respective ones of the two or more priority queues based on the measurement of total traffic intensity, the two or more priority queues having different delay metrics associated therewith, assign the packet to one of the two or more priority queues based on the expected quality of service and the determined traffic intensity associated with the respective ones of the two or more priority queues, and determine a price of service for the packet based on the determined delay metric associated with the one of two or more priority queues assigned to the packet.
 10. The network of claim 9, further comprising: a switch configured to switch the packet to the one of two or more priority queues.
 11. The network of claim 9, further comprising: a traffic intensity measurement device configured to measure the total traffic intensity and to transmit the total traffic intensity to the queue controller.
 12. The network of claim 9, wherein the expected quality of service includes a requested priority from a user, the requested priority being one of two or more priority levels from which the user selects.
 13. The network of claim 12, wherein the queue controller is further configured to receive an expected quality of service associated with a mobile terminal, to receive the packet from the mobile terminal and to associate the packet with the expected quality of service associated with the mobile terminal.
 14. The network of claim 9, wherein the delay metric is an average delay and the queue controller is configured to determine the average delay based on an average service time and an average waiting time.
 15. The network of claim 9, wherein the delay metric is a P-th percentile delay and the queue controller is configured to determine the P-th percentile delay based on a probability approximation including a waiting time for the one of two or more priority queues, a number of components of a probability mass function (PMF) associated with a service time, and a resolution of the PMF associated with the service time.
 16. The network of claim 9, wherein the delay metric is a truncated average delay approximation and the queue controller is configured to determine the truncated average delay approximation based on a waiting time for the one of two or more priority queues and a delay for the non-prioritized system. 